Data Munging
Model builtding
Regular expressions: misspellings, regular expression patterns e https://www.analyticsvidhya.com/blog/2016/01/guide-data-exploration/
In [ ]:
https://github.com/NathanEpstein/Dora
Feature Selection & Extraction
In [ ]:
In [ ]:
https://github.com/LuminosoInsight/python-ftfy
Given Unicode text, make its representation consistent and possibly less broken.
In [ ]:
In [ ]:
http://nbviewer.jupyter.org/github/rasbt/pattern_classification/blob/master/preprocessing/about_standardization_normalization.ipynb
About Feature Scaling: Z-Score Standardization and Min-Max Scaling
In [ ]:
In [ ]:
https://github.com/NathanEpstein/Dora
data versioning